TA2N: Two-Stage Action Alignment Network for Few-Shot Action Recognition

نویسندگان

چکیده

Few-shot action recognition aims to recognize novel classes (query) using just a few samples (support). The majority of current approaches follow the metric learning paradigm, which learns compare similarity between videos. Recently, it has been observed that directly measuring this is not ideal since different instances may show distinctive temporal distribution, resulting in severe misalignment issues across query and support In paper, we arrest problem from two distinct aspects -- duration evolution misalignment. We address them sequentially through Two-stage Action Alignment Network (TA2N). first stage locates by affine transform, warps each video feature its while dismissing action-irrelevant (e.g. background). Next, second coordinates match spatial-temporal performing temporally rearrange spatially offset prediction. Extensive experiments on benchmark datasets potential proposed method achieving state-of-the-art performance for few-shot recognition.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Generative Approach to Zero-Shot and Few-Shot Action Recognition

We present a generative framework for zero-shot action recognition where some of the possible action classes do not occur in the training data. Our approach is based on modeling each action class using a probability distribution whose parameters are functions of the attribute vector representing that action class. In particular, we assume that the distribution parameters for any action class in...

متن کامل

k-Shot Learning for Action Recognition

In the problem of k-shot learning, a model must learn to reliably classify an example having seen only k previous instances of examples of the same class. With recent success in using memory in neural networks to perform kshot learning, we propose a technique that uses MemoryAugmented Neural Networks to perform k-shot learning for action recognition in videos. We believe the use of memory will ...

متن کامل

Alternative Semantic Representations for Zero-Shot Human Action Recognition

A proper semantic representation for encoding side information is key to the success of zero-shot learning. In this paper, we explore two alternative semantic representations especially for zero-shot human action recognition: textual descriptions of human actions and deep features extracted from still images relevant to human actions. Such side information are accessible on Web with little cost...

متن کامل

One-Shot Learning for Real-Time Action Recognition

The goal of the paper is to develop a one-shot real-time learning and recognition system for 3D actions. We use RGBD images, combine motion and appearance cues, and map them into a new overcomplete space. The proposed method relies on descriptors based on 3D Histogram of Flow (3DHOF) and on Global Histogram of Oriented Gradient (GHOG); adaptive sparse coding (SC) is further applied to capture h...

متن کامل

One Shot Similarity Metric Learning for Action Recognition

The One-Shot-Similarity (OSS) is a framework for classifierbased similarity functions. It is based on the use of background samples and was shown to excel in tasks ranging from face recognition to document analysis. However, we found that its performance depends on the ability to effectively learn the underlying classifiers, which in turn depends on the underlying metric. In this work we presen...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2022

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v36i2.20029